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Abstract 

We consider the problem of testing hypotheses on the copula density from n bi- 
dimensional observations. We wish to test the null hypothesis characterized by a 
parametric class against a composite nonparametric alternative. Each density under 
the alternative is separated in the L2-norm from any density lying in the null hypoth- 
esis. The copula densities under consideration are supposed to belong to a range of 
Besov balls. According to the minimax approach, the testing problem is solved in an 
adaptive framework: it leads to a log log term loss in the minimax rate of testing in 
comparison with the non-adaptive case. A smoothness-free test statistic that achieves 
the minimax rate is proposed. The lower bound is also proved. Besides, the empirical 
performance of the test procedure is demonstrated with both simulated and real data. 



Index Terms — Adaptation, Copula Density, Minimax Theory of Test, Goodness 
Test of Fit. 
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1 Introduction 



Copulas became a very popular and attractive tool in the recent literature for modeling 
multivariate observations. The nice feature of copulas is that they capture the struc- 
ture dependence among the components of a multivariate observation without requiring 
the study of the univariate margins. More precisely, Sklar's Theorem ensures that any 
d— varied distribution function H may be expressed as 



where the F^'s are the margins and C is called the copula function. [21] states the existence 
and the uniqueness of C as soon as the random variables with joint law H are continuous. 

Modeling the dependence is a great challenge in statistics, specially in finance or as- 
surance where (for instance) the identification of the dependence structure between assets 
is essential. Many authors proposed parametrical families of copulas {C\,X G A}, each 
of them being available to capture different dependence behavior. The elliptic family 
contains the Gaussian copulas and the Student copula which are often used in finance. 
For insurance purposes, heavy tails are needed and copulas coming from the archimedian 
family are used. Among others, the more common are the Gumbel copula, the Clayton 
copula or the Frank copula. In view to illustrate the different behaviours of the tails of 
several copula densities, some graphs corresponding to the models cited above are pre- 
sented below. The parameters are chosen such a way that the associated Kendall's tau 
(i.e. the indicator of concordance/discordance) is identical in all illustrations. 



Figure 1: Kendall's tau= 0.25. Left: Bi-dimensional Gaussian copula density with parame- 
ter p = 0.4. Right: Bi-dimensional Student copula density with parameter (p, v) = (0.4, 1). 

Since many parametric copula models are now available, the crucial choice for the 
practitioner is to identify the model which is well-adapted to data at hand. Many goodness- 
of-fit tests are proposed in the literature. [14] give an excellent review and propose a 
detailed empirical study for different tests: we refer to this paper for any supplementary 
references. Roughly speaking, they study procedures based on empirical processes. Among 
others, they deal with rank-based versions of the Cramer- von- Mises and Kolmogorov- 
Smirnov statistics. They also consider test based on Kendall's transform. Basically, they 
restrict themselves to test statistics built from empirical distributions (empirical copula 
or transform of this latter). On a theoretical point of view, the asymptotic law under the 
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Figure 2: Kendall's tau= 0.25. Left: Bi-dimensional Prank copula density with parameter 
6 = 2.5. Center: Bi-dimensional Gumbel copula density with parameter 6 = 1.33. Right: 
Bidimensional Clayton copula density with parameter 6 = 0.66. 

null of the test statistic is stated in a number of papers (see by instance [5], [6] and [7]). It 
allows in particular to derive the critical value but generally the alternative is unspecified 
and the properties on the power are empirically given from simulations. 

In our paper, it is supposed that the copula C admits a density copula c with respect to 
the Lebesgue measure. To our knowledge, [8] was the first author to propose a goodness- 
of-fit test based on nonparametric kernel estimations of the density copula. In the same 
spirit as the papers cited above, he derived the asymptotic law of the test statistic under 
the null. His results are valid for bandwidths greater than n"^/^^"'"'^^ which correspond to 
enough smooth copula densities. 

Here, we focus on the minimax theory framework: we define the test problem as 
initiated by [16]. One of the advantages of this point of view is to precisely define the 
alternative: it is then possible to quantify the risk associated with the test problem as the 
sum of the first type error and the second type of error. Since this risk measure provides a 
quality criterion, it is then possible to compare the test procedures. Indeed, the alternative 
Hi{vn) is defined from a positive quantity Vn measuring the distance between the null and 
the latter. Obviously, the larger is this separating distance, the easier is the decision. The 
aim of the minimax theory is to determine the larger alternative for which the decision 
remains feasible. Solving the lower bound problem is equivalent to exhibit the faster 
separating rate Vn such that the risk is bounded from below by a given positive constant 
a: this rate is called the minimax rate of testing. Next, the upper bound problem 
has to be solved exhibiting a test procedure whose risk is bounded from above by a given 
a, that is, the statistic test allows to distinguish the null from Hi{vn), where f„, is the 
minimax rate. 

In the white noise model or in the density model, the goodness-of-fit problem (stands 
as explained above) was solved for different regularity classes (Holder or Sobolev or 
Besov) associated with various geometries: pointwise, quadratic and supremum norm. 
For fixed smoothness of the unknown density (minimax context), there is a rich liter- 
ature summed-up in [IT] and in [19]. Optimal test procedures include orthogonal projec- 
tions, kernel estimates or procedures. Goodness-of-fit tests with alternatives of variable 
smoothness into some given interval (adaptive context) were introduced by [25] for the 



L2-norm in the Gaussian white noise model and generahzed by [26] to Lp-norms. [18] 
proved that a collection of tests attains the adaptive rates of goodness-of-fit tests in 
L2-norm as well as for the density model. 

For sake of simplicity, we restrict ourselves to bi-dimensional data but there is no 
theoretical obstacle to generalize our results to higher dimensions. Suppose that we observe 
n i.i.d. copies (Xj,yi)jgx where X = {1, . . . , n} of [X^Y). The random vector (X, y) is 
drawn from the distribution function K expressed through the copula C . Moreover, it 
is assumed that C has a copula density c with respect to the Lebesgue measure and F 
and G stand for the cdf's of X and Y respectively. From {Xi^Yi)i^Xi we are interested in 
studying the goodness-of-fit problem when the null is a composite hypothesis ffo : c G Ca 
for a general class Ca of parametrical copula densities. Since the alternative is defined 
from the quadratic distance, we propose a goodness-of-fit test based on wavelet estimation 
of an integrated functional of the copula density. Indeed, [12] and [1] show that the 
wavelet methods are an efficient tool to estimate the copula densities since these latter 
have very specifics behaviors. Unfortunately no direct observations {F{Xi)^G{Yi)) for 
z G T are available since F and G are unknown, the test statistic is then built with pseudo- 
observations G'(yi))jgj: as usual in the copula context, the quantities of interest 
are rank-based statistics. We provide an auto-driven test procedure and we produce its 
rate when the alternative contains a regular constraint: since the procedure is based on 
wavelet methods, the linked functional classes are the Besov classes Bs^p^q- We give results 
for p > 2 (dense case) and s > 1/2. The constraint s > 1/2 is due to the fact that pseudo- 
data are used and then a minimal regularity is required in order to pay no attention to 
substitute the direct data with the ranked data. Observe that [20] have the same constraint 
in the univariate regression model when the design is random with unknown distribution. 
Next, we prove that our procedure is minimax (and adaptive) optimal by exhibiting the 
minimax adaptive rate. This one looks like the minimax rate but an extra log log term 
appears: we prove that this loss is the price to paid for adaptivity. To our knowledge, 
the proof of the adaptive lower bound in the multivariate density model when the null is 
composite has never been clearly written. 

Next, we allocate a part to empirical studies. Simulation allows us to show that, when 
the theoretical framework is respected, the power qualities of our test procedures are good. 
We choose to make simulations starting from the parametrical copula families presented 
at the beginning of the introduction and which are the more common for applications. We 
compare our simulation results with those of [14] . Then, we study a very well known sample 
of real life data of [9] consisting of the indemnity payment (LOSS) and the allocated loss 
adjustment expense (ALAE) for 1500 general liability claims. The most popular model for 
the copula is a Gumbel copula model with parameter 6 = 1.45 (which may be estimated 
by inverting the Kendall's tau) given in Figure 3. Among other results, it is empirically 
shown that the Gumbel and the Gaussian copula models are acceptable while Student, 
Clayton or Frank models are rejected. Figure 3 gives a wavelet estimator of the copula 
density of {LOSS, ALE A) by the method explained in [1]. Visually, fitting the unknown 
copula with the Gumbel model seems indeed to be the most appropriated. 

The paper is organized as follows. In Section 2, we first provide a general description of 
orthonormal wavelet bases, focusing on the mathematical properties that are essential to 
the construction of the statistics that we consider. In Section 3, we provide the inference 
procedures: first, we explain how to estimate the square L2-norm of the copula density 
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Figure 3: Left: Thresholded wavelet estimator for the copula density of {LOSS, ALE A) 
as given in [1]. Center: Gumbel copula density with parameter 9 = 1.45. Right: Gaussien 
copula density with parameter p = 0.48. 

and next we derive the procedure of goodness-of-fit. The theoretical part is exposed in 
Section 4: first, we state very precisely the test problem under consideration; we define 
the criterion allowing to measure the quality of test procedures and define the separating 
minimax rate. In Section 5, the main results are stated: our test procedure is shown to 
be optimal in the sense defined in the previous section. Section 6 is devoted to practical 
results with both simulated and real data. We conclude these parts with a discussion in 
Section 7. The proof of the upper bound is given in Section 8 while the proof of the lower 
bound is given in Section 9. Finally, all technical or computational lemmas which are not 
essential to understand the main proofs, are postponed in appendices. 

2 Wavelet Setting 
2.1 Wavelet expansion 

In the univariate case, we consider a wavelet basis of L2{[0, 1]) (see [4]). Let cp be the scaling 
function and let ip be the same notation for the associated wavelet function and its usual 
modifications near the frontiers and 1. They are chosen compactly supported on [0,L], 
L > 0. Let j in IN, ki in "ZL and for any univariate function set = 2-'/^$(2-' •— fci). 

In the sequel, we use wavelet expansions for bivariate functions and we keep the same 
notation as for the univariate case. Then, a bivariate wavelet basis is built as follows: 

4'j,k{^,y) = 4'jMi^)'^jM(.y)^ '^fli^^y) = 4'jMi^)'^jMiy)' 

(2) (3) 

'>Pj,k(^^y) = V'i,fci(^)'Aj,fc2(2/), '^j.ki^^y) = V'i,fci(a^)V'i,fc2(y)' 

where the subscript k = {ki, ^2) indicates the number of components of the functions (j)j^k 
and ipj,k- For a given j G IN, the set 

ihk^i^lk', ^ > J, (k, k')G7L^ x7L^,e = 1,2,3} 
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is an orthonormal basis of L^([0, 1]^) and the expansion of any real bivariate function ^> 
in L^([0, 1]^) is given by: 



where the scahng coefficients and the wavelet coefficients are 

VjG]N,VA;G^2^ A,^k = [ ^4>j,k, ^1, fc = / "^i^lk- 

The Parseval Equality immediately leads to the expansion of the square L2-norm of the 
function 

= T,+B,, (1) 
where the trend and the detail terms are respectively: 

oo 3 

= J2 (^^•.'^•)' and B,=Y^ J] ^{31,)'. (2) 

feeffi^ ken? <:=i 

Notice that, since the support of $ is [0, 1]^, the sum over the indices k is finite: there are 
no more than (1? + Vf' terms in the sum (recall that L is the length of the support of <?!)). 
In order to simplify the notations, the bounds of variation of k and e in expansion of any 
are omitted in the sequel. 

2.2 Besov Bodies and Besov spaces 

Dealing with wavelet expansions, it is natural to consider Besov bodies as functional spaces 
since they are characterized in term of wavelet coefficients as follows. 

Definition 1. For any s > 0, p > 1 and any radius M > 0, a d— varied function $ 
belongs to the ball bg^p^ooiM) of the Besov body 6<j,p,oo if end only if its sequence of wavelet 
coefficients Bj ^ satisfies 

3 

Vj e IN, ^ ^ \Blk\P < M2-^'("+"'/2-d/p)p^ 

The Besov body 6s,p,oo coincides with the more standard Besov space Bg^p^oo when 
there exists an integer N strictly larger than s and such that the g— th moment of the 
wavelet vanishes for any q = 0, . . . , — 1. It is possible to build univariate wavelets 
whose support is included in [0, 2N — 1] satisfying this property for any choice of N (see 
the Daubechies wavelets). 

In the sequel, we need to bound the detail term Bj defined in (2). We use the following 
inequality 

oo / 3 \'/^ 

v.GiN, b,<y: [y: ^fer (K2^^y-^/^ 



6 



where X is a positive constant depending on the supports of $ and "0. Assuming that 
the function <I> belongs to 6s,p,oo(Af) with s,p and M as in Definition 1, the fohowing 
inequahty holds 

Vj G IN, Bj < k 2-2^'^, (3) 

where K \s a, positive constant depending on the supports oi ij: and on the radius M. 
When $ is a copula density, K = M'^/p (3(L + l)^)^"^/^. 

3 Statistical Procedures 

Assuming that the copula density c belongs to -^^2([0, 1]^), we first explain the procedure 
to estimate the square L2— norm of c 




which is used to define the alternative of the goodness-of-fit test. The statistical methods 
depend on parameters (the level j for the estimation procedure and j and the critical 
value tj for the test procedure) which are discussed and determined in an optimal way in 
Section 5. 

It is fundamental to Notice that, for any bivariate function one has 

IE,[^{U,V)\ = ]Eh[^{F{X),G{Y))], (4) 

where h stands for the joint density of {X,Y). This means in particular that the wavelet 
coefficients {cj^k^c^k^ ^ > j,k ^ 7L^,e = 1,2,3} of the copula density c on the wavelet 
basis 

Wj,k,i^lk, i>j,ke7L^,e = 1,2,3} 
are equal to the coefficients of the joint density h on the warped wavelet family 

WjM^i-), G{-)),i;l,{F{-),G{-)), £>j,ke7Z.^e = 1, 2, 3}. 

The statistical procedures are based on the wavelet expansion of the copula density c, for 
which the wavelet coefficients have to be estimated. 

3.1 Procedures to estimate 9 

Let J be a subset of INT and consider a given j in J. Motivated by the wavelet expansion 
(1), we propose to estimate 9 with an estimator of the trend Tj omitting the detail term 
Bj. Using the orthonormality property of the wavelet basis, it leads to estimate the square 
of the coefficients of the copula density on the scaling function. As usual, a [/—statistic 
associated with the empirical coefficients is used in order to remove the bias terms. Due 
to (4), we first consider the following family of statistics {Tj,j £ J} defined by 

= '^^j,k, 

k 
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where 9j^k is the following [/—statistic 

1 " 

= -f TT E hkiFiXi,),G{Yi,))4>j,k{F{Xi,),G{Yi,)). 

nin — 1) ^-^ 

«1,«2 = 1 

Since no direct observation (F{Xi), G{Yi)) is usually available, it is replaced in 0j ^ by the 
pseudo observation {F{Xi), G{Yi)), where F, G denote some estimator of the margins. To 
preserve the independence given by the observations, we split the initial sample (X,, l^)jgx 
into disjoint samples {Xi,Yi)i^j-^ and {Xi,Yi)i^j2 withT2UXi = X, X2nXi = 0, and whose 
size is ni and n2 respectively. The sub-sample with indices in Xi is used to estimate the 
marginal distributions and the second one with indices in X2 is devoted to the computation 
of the [/-statistic. We consider the usual empirical distribution functions: 

It leads to the family {Tj,j £ J} of estimators of 9 



with 

n2(n2 — 1) ^-^ V '^i '^1 / V ^1 ^1 



where Rp = niF{Xp) and Sp = niG{Yp), p G Xi, could be viewed as estimates of the rank 
statistics of Xp and Yp respectively. 



3.2 Test Procedures 

In this part, we consider a family of known bivariate copula densities Ca = {cx, A G A} 
indexed by a parameter A varying in a given set A C IR'^'^ , d/i^ G IN* . From the observations 
(Xi,Yi)i^x, our aim is to test the goodness-of-fit between any cx and a copula density c, 
which is enough distant in the L2-norm, from the parametric family C\. Acting as in 
paragraph 3.1, we estimate the square L2-norm between c and a fixed element cx lying in 
the family Ca by 

TjW = EMA), (5) 

A: 

for 

MA) = -T^—r, E f'^.^f— 

712(712 - 1) . ^ V V '^i '^i / / 

n7^«2 
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where {cj.fc(A), fc G G IN} denote the known scahng coefficients of the target copula 

density cx. Notice that, if direct observations {F{Xi),G{Yi))i^j would be available, the 
appropriate test statistic Tj{X) would be 

k 

where 

MA) = , ^ E {^,,k{F{X,,),G{Y,,))-c,.k{X)) 

"2(^2 - 1) ^^^^^^ 

x(,^,-fc (F(Xi,),G(yiJ)-c,,fc(A)). 

Now we are ready to build the test procedures. Let us give a set of indices J and a set of 
critical values {tj, j G J} and define {Dj',j G J}, the family of test statistics 

^ infr,(A)>t. ' 

allowing to test if c belongs to the parametric family C\ = {cx,X G A}. Notice that 
A = {Ao} leads to the single null hypothesis Hq : c = cxq- We are also interested in 
building auto-driven procedures by considering all the tests in the family 

D\ = max = 1 / . r ^ / x n ^ n r, • (6) 
jej ^ max(mf r,(A) -tj) > ^ ^ 

jGJ AgA 

The sequence of parameters tj of the method are determined in an optimal way in Section 
5. We explain in Section 4 what "optimal way" means in giving a presentation of the 
minimax theory for our framework. 



4 Minimax Theory 

We adopt the minimax point of view to solve the problem of hypothesis testing, initiated 
by [16] in Gaussian white noise. A review of results obtained in problems of minimax 
hypothesis testing is available in [17] and [19]. Let us describe this approach. 

4.1 Minimax hypothesis testing Problem 

As in the previous section, we consider C\ = {cx, A G A} a given functional class of copula 
densities. For any given r = {s,p,M), with s > 0,p> 1,M > 0, the following statistical 
problem of hypothesis testing is considered. 

Ho : c = Cx e Ca against Hi : c € T{vniT)), (7) 

with 

r(vn(T)) = 6s,p,oo(M) n\c: inf ||c - ca|| > Vnir) } , 
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where &s,p,oo(-^) is the bah of radius M of the Besov body bg^p^oa defined in Definition 1 and 
Vn{T) is a sequence of positive numbers, depending on r and decreasing to zero as n goes 
to infinity. Recah that H^H denotes the L2-norm of any function g in L2([0, 1]^). Observe 
that the functional class r(f„(r)), which determines the alternative ffi, is characterized 
by three parameters: the regularity class 6s,p,oo where the copula density is supposed to 
belong, the L2-norm which is the geometrical tool measuring the distance between both 
hypotheses, and the sequence Vnij). 

According to the principle of the minimaxity, the regularity space and the loss function 
are chosen by the statistician. Notice that the parameter r could be known or unknown. 
Obviously, our aim is to consider tests which are able to detect alternatives defined with 
sequences Vn{T) as small as possible. It can be shown ([17]) that Vnir) cannot be chosen 
in an arbitrary way: indeed, if Vn{T) is too small, then Hq and Hi cannot be distinguished 
with a given error a G (0, 1). Therefore, solving hypothesis testing problems via the 
minimax approach consists in determining the smallest sequence Vn{T) for which such a 
test is still possible and to indicate the corresponding test functions. The smallest sequence 
Un(r) is called the minimax rate of testing. Let be a test statistic i.e. an arbitrary 
function with possible values 0, 1, measurable with respect to {Xi,Yi)i^x and such that we 
accept Hq if Dn = and we reject it if Dn = 1. 

Definition 2. Assuming t to he known, the sequence Vn{T) is the minimax rate of testing 
Hq versus Hi if relations (8) and (9) are fulfilled: 

• for any given ai G (0, 1), there exists a > such that 



where the infimum is taken over any test statistic Dn, 

• there exists a sequence of test statistics {Dn)n for which for any given 02 in (0, 1), 
it exists ^ > such that 



where IPc, respectively IP\ denotes the distribution function associated with the copula 
density c, respectively with cx. 

4.2 Adaptation 

Nevertheless, since the copula function itself is unknown, the a priori knowledge on r could 
appear unrealistic. Therefore, the purpose of this paper is to solve the previous problem 
of test in an adaptive framework i.e. in supposing that r = (s,p,M) is unknown but 
varying in a known set S. Comparing the adaptive case with the non-adaptive case, it 
has been proved in different frameworks that a loss of efficiency in the rate of testing is 
unavoidable (see for instance [25], [10]). This loss is expressed as tn, a positive constant 
or a sequence of positive numbers increasing to infinity with n (as slow as possible), which 
appears in the rate of testing v^^-i{t). Similarly to the minimax rate of testing, we define 
the adaptive minimax rate of testing as follows. 






(9) 
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Definition 3. The sequence v^^-i{t) is the adaptive minimax rate of testing if relations 
(10) and (11) are satisfied 

• for any given ai G (0, 1), there exists a > such that 



lim inf sup ]Px{Dn = 1) + sup sup iPcpn = 0) > ai, (10) 
where the infimum is taken over any test statistic Dn, 

there exists a sequence of universal test statistics (free of t) such that, for any 
given a2 in (0, 1), there exists A> such that 



lim sup iPA(£'* = 1) + sup sup PciD^ = 0) ] < a2 (11) 

"^+°^\cAeCA rG5cGr(A,; _l(r)) 

where tn is either a positive constant or a sequence of positive numbers increasing to 
infinity with n as slow as possible. 

Notice that relations (10) and (11) (instead of relations (8) and (9)) mean that the 
minimax rate of testing Un(T") is contaminated by the term t„ in the adaptive setting. 
Observe that the same phenomenon is observed in the estimation problem where an extra 
logarithm term t„ = log(n) has often (but not always) to be paid for the adaptation. 



5 Main results 

In this section, we focus on test problems for which the parametric family C\ is included 
in some 6sa,pa,oo(^^a) where sa > 0, pA > 1 and Ma > are known. 

Our theoretical results concern the minimax resolution of the problem of hypothesis 
testing defined in (7) in an adaptive framework. Theorem 1 states the result of the 
lower bound (see relation (10)). Then, Theorem 2 exhibits the rate achieved by the test 
procedure proposed in Section 3 (see relation (11)). Comparing the rate of our procedure 
with the fastest rate given in Theorem 1 leads to Theorem 3 establishing the optimality 
of our procedure. 

First, let us state the assumption which gives a control of the complexity of Ca- 
• AO: the set A is compact in IR'^'^ and 

sup \cx{x,y) - CA'(x,y)| < QUA - A'||J[^dA, VA, A' G A, 

(x,j/)G[0,l]2 

where is a positive real, Q is a positive constant and || • Hj^dA denotes the Euclidean 
norm m 
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5.1 Lower Bound 



As it is usual for composite null hypotheses, the result of the lower bound requires the ex- 
istence of a particular density G Ca (see assumption AInf below) in order to construct 
a randomized class of functions which must be included in the alternatives. 

• AInf: there exists a parameter Aq in A such that 

\/{u,v) G [0,1]^, cxq{u,v) > m, with m > 0. 

Theorem 1. Suppose that S defined by 

S = {t = {s,p,M),s> 1/2, p>2,M > : s - 2/p < sa - 2/pA, Ma < M} (12) 

is nontrivial (see [.l')]), which means that there exist p > 2, M > and < s^^^^ < Smax 
such that 

^« ^ [smiw^max], {s,p,M) G S 
and assume that AO and AInf hold. Set 

^nt-(^) = (nt-^)-2^/(^^+2) With tn = Vl0g(l0g(n)). 

Then, it exists a positive constant a such that 



lim inf{supiPA(£'n = 1) +sup sup PdDn = 0)} \ = 1, (13) 

n-»+oo \ D„ xeA re5cer(aj; _i(t)) 

where the infimum is taken over any test function Dn- 
5.2 Upper Bound 

Theorem 2 deals with relation (11) which holds for the test statistic Da defined by 
relation (6) as soon as the parameters of the methods are chosen as follows. The set 
J = { LjoJ , • • • , LJooJ } is determined by 

2-=l„gMlog(„0. 2'~ = (i^) A(i^) . (") 

where q is the order of differentiability of the scaling function (p. The critical values satisfy 

2i 

Vj G J, tj = 3fi — A/log log(n2), (15) 
n2 



where ^ is a positive constant such that /i > y^2KgKi, and Kg and Ki are positive 
constants depending on ||i?^||oo) ||c||oo) I|ca||oo and the length of the support of (see 
Lemma 3). 
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Theorem 2. Let us choose ni = n n and 1x2 = (1 — 7r)n for some tt in (0, 1). Assume 
that the scaling function (j) is continuously q— differentiahle for 



> 



log 



log ("2) 



log 



rii 
log(ni) 



Moreover assume that any density c under the alternatives or any c\ under the null are 
uniformly hounded. Then, the test statistic D\ defined by (6) is such that 



lim sup ]Px{Dp, = l) = 0. 



Assume that AO holds, then there exists a positive constant A such that 

0, 



lim sup sup ]Pc{D\ = 0) 

"i^"2-^+°°TeScGr(Ai) _i(r)) 



(16) 



(17) 



where 



i(r) = (n20-2^/(^^+2) and = Vlog(log(n2)). 



Relation (11) of the upper bound holds since both relations (16) and (17) are satisfied. 
Notice also that relation (16) indicates that the test statistic Da is asymptotically of any 
level in (0,1). 

5.3 Optimality 

As a corollary of Theorem 1 and Theorem 2, we obtain 

Theorem 3. Under the assumptions of Theorem 1 and Theorem 2, our test procedure 
defined by Relation (6) is adaptive optimal over the range of parameters t £ S where S is 
defined by equation (12). 



6 Practical results 

The purpose of this section is to provide several examples to investigate the performances 
of the test procedure presented in Section 3. This part is not exactly an illustration of 
the theoretical part since it does not focus on the separating rate between the alternative 
and the null hypothesis, but it is devoted to the study of our test procedure from a risk 
point of view. Note also that we do not use exactly the theoretical procedure described 
in the previous section. As usual for practical purpose, we replace theoretical quantities 
by more adapted quantities obtained with resampling methods. In the first part, we fix 
the test level a = 5% and we study the empirical power function. In the second part, we 
present an application to some economical series. 
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6.1 Methodology 



On the contrary to the estimation problem, a smooth wavelet is not needed. The test 
statistic is then computed with the Haar wavelet since it has a small support and then 
it leads to a fast computation time. The critical value of the test is determined with 
bootstrap methods: the standard deviation of the test statistic is computed thanks to 
Nboot = 20 resampling. The size of the simulated samples is n = 2048 which is reasonable 
for bi-dimensional problems in an asymptotic context. For the real life data example, the 
number of data is around n = 4000. For the simulation part, the empirical level of the 
test is derived from Nmc = 500 replications for each test problem. 

6.2 Simulations 

The setup of our simulations is closely related to the work of [14], except that they con- 
sider small samples (of size 150) since their test procedures are based on the empirical 
copula distribution (and thus generate parametrical rates). To explore various degrees of 
dependance, three values of Kendal's tau are considered, namely r = 0.25,0.50,0.75 for 
the following copula families: Clayton, Gumbel, Frank, Normal and Student with four de- 
grees of freedom (df). Calculations are made with the MatLab Sofware. The results of the 
simulations are presented in Table 1. For an easier reading, the estimated standard errors 
of the empirical powers are presented in italics. Furthermore, for each testing problem 
we highlighted the estimated errors of the first type (estimators of a = 0.05) using bold 
characters. In brackets, we give the results obtained by [14] with their test procedures, 
denoted CvM and built on rank-based versions of the familiar Cramer-von Mises statis- 
tics. It would be also possible, if one is interested in, to compare with the different test 
procedures (based on the empirical copula distribution) proposed also by [14]. 
Let us now summarize the conclusions made from the simulation results. 

• Our test is degenerated: we almost always accept Hq (when Hq is true) while the 
procedure of Genest et al. [14] produces an excellent estimation of the prescribed 
level a. It is a characteristic of the adaptive minimax procedures. 

• For small level of dependence r = 0.25, our procedure is very competitive and 
produces (almost) always a better empirical power than the CvM test. The results 
are spectacular when the fit caq is a Student (4). 

• When a large Kendal's tau is considered, our procedure fails when the data are 
issued from a Clayton copula density. The procedure is not available to recognize a 
structure of dependence modeled with a Clayton. 

• The improvment of our results with respect to the CvM test is decreasing with the 
Kendal's tau. The CvM test becomes better when the tau is increasing whereas for 
us it is the opposite. 

In conclusion, we recommend the use of our test procedures when the Kendal's tau is 
not too large since it seems to outperform the existing procedures based on the copula 
distribution. This situation corresponds to our theoretical setup related to the functional 
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spaces in which the unknown copula density is supposed to hve. Unfortunately, the prac- 
tical results do not give hope for using this procedure when the copula densities present 
high peaks (as it is the case for the Clayton copula density with a large tau). 

6.3 Real data 

We present now an application to real data of our test procedure. The level of each 
test (with simple null hypothesis or multivariate null hypothesis) is a = 5%. To obtain 
the empirical level, = 50 replications of our procedure computed with the half of the 
available data (chosen randomly) is used. Table 2 gives the empirical probability to reject 
the null hypothesis and the final decision. "Yes" means that we accept that the structure 
of dependence belongs to the considered family and "No" that we reject the fitting. 

We consider the data of [9], which were also analyzed by [11], [21], [3] and [13], among 
others. The data consist of the indemnity payment (LOSS) and the allocated loss adjust- 
ment expense (ALAE) for 1466 general liability claims. 

We consider the following test problems: 

Hq: cGCa 

where the parametrical family Ca is described in Table 2. Since the Kendall's tau computed 
with the sample is r = 0.31, we choose an adapted grid of parameters for each parametrical 
family of copula densities. Next, assuming that the density copula of the data belongs to 
a fixed parametric family, we estimate the parameter A 

• by A in inverting the Kendall's tau (third part of Table 2 where Hq : c = c^). 

• by A in minimizing the average square error (ASE) computed thanks to the bench- 
mark given in Figure 3 (fourth part of Table 2 where Hq : c = c^). For information, 
we give the relative ASE computed with into brackets. 

The various authors who analyzed this data set concluded that the Gumbel copula pro- 
vides an adequate representation of the underlying dependence structure. The Gumbel 
parametric family of extreme- value copulas captures the fact that almost all large indem- 
nity payments generate important adjustment expenses (e.g., investigation and legal costs) 
while the effort invested in the treatment of a small claim is more variable. Accordingly, 
the copula exhibits positive but asymmetric dependence. Confirming this result, the adap- 
tive method of estimation proposed by [1] provides a benchmark (see Figure 3) for the 
copula density associated with the data. 

7 Discussion 

The paper is mainly devoted to construct an optimal procedure for solving a general 
nonparametric problem of test: both hypotheses are composite, very general parametric 
family could be considered under the null. Our procedure is proved asymptotically to be 
adaptive minimax and the minimax separating rate is exhibited over a range of Besov 
balls. 
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Thanks to the simulations and a appUcation to real data, our procedure seems to be 
competitive on the power point of view even if the setting of test under consideration is, 
in the simulation study, clearly parametric. 

It is worthwhile to point out that the copula model requires more regularity (than the 
usual density model) since the approximation due to the rank-based statistics needs to be 
accurate enough (see Lemma 4). 

One must notice that only copulas densities belonging to dense Besov spaces (i.e. 
defined with a parameter p larger than 2) are under consideration in this paper although 
several copula densities with a strong dependence structure belong to sparse Besov 
spaces (i.e. defined with a parameter p smaller than 2). As it is illustrated in the simula- 
tion study, our test procedure fails for the Clayton copula density with large parameters. 
This density is suspected to belong to a sparse Besov ball. The study of sparse Besov 
balls would require the determination of a new test strategy which would lead to another 
minimax rate of testing: these objectives are beyond those of the present paper and will 
be explored in a further work since the set of copulas densities contains a number of sparse 
functions. For sparse Besov balls and in the white noise model for testing the existence of 
the signal, [22] proved that the minimax testing rate in the sparse and the dense cases is 
different. They also proved that it is possible to built an adaptive minimax (non linear) 
procedure of test for the sparse case. 

A very close problem is the sample comparison test (problem with two samples). It 
could be interesting to test if the structure of dependence between a couple of variables 
Vi = {X, Y) is the same as for another couple V2 = (Z, T). This problem of tests could be 
stated as follows: 



Ho : cvi = CV2 against Hi : (0^1,0^2) £ r(t;„(r)), 

with 

,pi,oo (M2)} 
n{(cvi,CV2) : llcvi - CVall > Vn-} 

where Vn is the separating rate of both hypotheses. In an analogous way as in Section 3, 
the rule for the comparison test would be 

max(y^ 6j^k ~ ij) > 

with 



2 



\ \ ni ni ' \ rii rii 
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where R^,R^,R^,R'^ are the rank statistics associated with X, Y, Z, T. Using the same 
tools as in [2], in which the homogeneity in law of the both samples is studied, it is possible 
to prove that this test is adaptive optimal and that the minimax rate of testing is 



Obviously, all these test procedures could be used in the multivariate framework (d > 2), 
but as usual in the nonparametric context, it will provide slower minimax rates of testing. 



Recall that for any given A G A, IP\ (respectively IPc) denote the distribution associated 
with density cx, respectively with c. In the same spirit, denote also lEx and Var\ (respec- 
tively lEc and Varc) the expectation and the variance with respect to IPx, respectively to 
IPc- When no index appears in IE or in IP it means that the underlying distribution is 
either IPc or IPx. 

8.1 Expansion of the statistics of interest 

Fix a level j in J. For the test problem, the statistic of interest Tj{X) (for A G A) defined 
in (5) is an estimate of 




8 Proof of Theorem 2 



k k 
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where 



9^ 



n2{n2 -, . . 

x{cPj^k{F{Xi,),GiYi,))-Cj^k) 



n2(n2 - 1) . . _ 



il,i2Gl2 
n^«2 



X (F(X,,),G(K,,))-c,-fc(A)) 
= - (0j,fc(i^(^n),G(y.J)-c,-fc)(c,-fc-c,,fe(A)). 

^^i^2 

The sequence {cj^fejj^fc denotes the unknown scahng coefficients of the unknown copula 
density c. RecaU that 

k 

with 

MA) = , ^ E fe(i^(^*J,G(y,,))-c,-fc(A)) 

"2(^2 - 1) ^^^^^^ 

n?^i2 

X fc(F(X,J,G(F,,))-c,,fc(A)). 

The following lemma gives some evaluation for the first moments of each statistic of 
interest. 

Lemma 1. Let q he a positive integer and assume that (f) is continuously q— differentiable. 
Let j be a level smaller than joo defined in (14)- Then, it exists some positive constant k 
which may depend 071 (p, ||c||oo? II'^aIIoo CLTid JM^ such that 



lETj{X) = Tj{\) and VarT,(A) < k |^(^— J + ) Tj{X) 



E\T'*\ < K 



Ui 

E.\TfiX)\ < J'^^TM)T and iE.(i;*(A))2<.2^YM!^y 
\ ni ' J J \ n2ni J 

Using the Bernstein Inequality, we establish the following bound for the deviation of 
the statistic Tj^(A) under the alternative. The proof is postponed to Appendix B. 
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Lemma 2. For any level j, for all x > 



iPc(|r;(A)| >x) <exp(-K{- 



where K is a positive constant depending on L, \\(j)\\oo o,nd ||c||oo- 



Using a result from [15], we establish the following bound for the deviation of the 
[/-statistics Tj{X) and T- . The proof is postponed to Appendix C. 



Lemma 3. For any level j, as soon as x > 2^ rig ^iyiog(log(n2)), for all /x > 0, 
iPA (|f,(A)| > i^ix) + IP, (\Tf\ > fix) < Kg{\og{n2))-^ 



for any positive 6 < fi'^{KgKi)~'^ , where Kg is an universal positive constant given in [15] 
and Ki is a positive constant depending on L, \\(j)\\oo o,nd either ||ca||oo or ||c||oo depending 
on the underlying distribution i.e. either IP\ or IP,. 

8.2 Proof of Relation (16) (First type error) 

Let us fix A G A and set 



PX = max 



inf TA\')-t^ 



> 



Notice that under the null 

r/(A) = Tj{X) = and T,-^ = fj{X). 
Using expansion (18), we get 



Px < j;iPA(mf^r,(A')>t,) 

< E^A(r,(A)>t, 



< E {^A (|f,(A)| > I) + iPA (|T,*| > I) + (|T*(A)| > I) } 
Due to Lemma 1 and using Markov Inequality, we obtain 

Px < } ^IFxi \Tj{X)\ > ^ 1 + > < -—i- + ^ ^ 



,73) {tj/sr 



<_ 2:.,(,^,.,i>|).j:{| 

< E^Afe(A)|>|) 
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Notice that Tj{X) is centered under IPx, then applying Lemma 3, where tj is tj = 
3fi 2^ 71,2 ^-\/log(log (712)), the constant fi is defined in (15) and since card(J) < log(n2), one 
obtains 



Px < Kgcaid{j) (log(7i2))~'^ + ircard( J)2-J" 



712 log("-l) 

Tliy^log log(n2) 



+i^card(J)2-^ 



JO 



\og{ni)nl 



712711 ^log log(7l2) 



< K3(log(n2))^"^ + ir2-J" 



log(7ii)log(rt2) \ 

A/log log(7l2) / ' 



where the last inequality holds since 5 satisfies 5 < p?{2KgKi) ^ (see Lemma 3). Since ^ 
is such that /i > y^ZK^AY, relation (16) is proved if one takes 5 = ii'^{2KgKi)-^ . 

8.3 Proof of Relation (17) (Second type error) 

Let us fix T G 5 and c E T[Av^^-\{t)) and set 



Pc = IPc ^max mf rj(A) - tj < 0^ . 
Using the expansion (18), we get, for any j* G J 

Pc < IPc (inf {2T;. (A) + T,(A) + T^. + T* + 2T*(A)} < t,.^ 

< IPc (^inf { 2r;. ( A) + Tj* ( A) } < 2t,. ^ 
+IPc (rf. + T* + 2inf {t*(A)} > tj^j 

< IPc (^inf { 2r;. ( A) + T,* ( A) } < 2tj* ^+IPc(t^.> tj* /s) 
+IPc (t* > tjV3) + finf {r*(A)} > tj./6^ 



= Pci{f)+Pc2{r)+Pc3{r)+Pc4{r). m 

Let us explain how j* is chosen. From the wavelet expansion (1) and Lemma 1, one has 

lEcT)*{X) = Tj*{X) = cxf - Bj*{X), 

where Tj*, Bj* are defined in (2) and t* is the critical value given in (15). Since c is in 
T{Av^^-i{t)) and cx lies in 6s^,p^,oo(AfA) C 6s,p,oo(M), the function (c-ca) is in 6s,p,oo(^)- 
We can choose j* such that 



2^ 



\ 1/(2^+1) 
K 712 \ 



3/X y/log log(7l2) 



20 



which is possible due to our choice of joo and because s > 1/2; the constant K appears 
in (3). It imphes that Bj* < tj* since Bj* < K2-2i*« (see Inequahty (3)). Next, since 
c G r(A'y^^-i (r)), one has /(c — cy)'^ > A'^{v^^-i{t))'^ for all A' G A. Focusing on rates 
7;^j-i(r) combined with positive constant A which satisfy Atj* < (Av^^-i (r))^, one obtains 



< 1/3. 



(20) 



Coming back to the evaluation of the probability terms (see relation (19)), we first consider 
Pci(j*)- Consider an rj-net on the set A that is for any A in A, denote A the closest (in 
the Euclidean sense) element in A^ to A (closer than ry). Due to assumption AO, let us 
prove that for any j £ J, Tj{X) + Tj{X) is close to r?(A) + Tj{X): 



\Tnx)-Tnx)\ 



1/2 



n2 



In the same way, one has, 

|r,(A)-T,-(A)| < AtQr?'^ (2max(||c||,||c^||) + Qr7''). 
Choosing tj = with bu > 1, then by (20) and applying Lemma 2, we get 
Pciif) < Yl JPc{2T^X)+T,.[X)<2t,.) 

n ^ 

< ^c(Tjl(A)<-T,-.(A)/6) 

n " 

n2 1/2 



1/2 



< 



< 



< 



K2 ( n2tj* A ^t-i 



n 



exp 



-A-3 (2^"(loglog(n2))i/2 
n2 \V2 



A 



2iV2 



(loglog(n2))i/^ 



(21) 



where D{A) is the diameter of A and K, K2 and are positive constants. Both terms 
behind the minus sign in the exponential of the right hand side of the last inequality tend 
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to infinity with a power of n since s > 1/2. Tliis implies that Pci{j*) goes to zero as n 
goes to infinity. 

Now, it remains to verify that Pc2ij*),Pc3ij*) and Pd^j*) are going to zero as ni A 77-2 
goes to infinity. Using again the bound (20), Lemma 3 for some positive 5, Lemma 1 and 
the definition of the critical value (15), one gets 

Pc2{j*) + PcsU*) + PdU*) 



< IPc [t^* > ijVs) + +lPc (t* > tj*/3) + IPc (t*(A) > ijV6 



< Kg{login2)y' + 9 ^ + 6 



iE,|r*(A)|2 1E,\T*\ 
45 h 6 — 



^ -u- n ( w-S , n 2^*log(ni) log(ni) 
< Ag(log(n2)) +9k— — "2 h 6k- 



nin2 tj* ni tj* 

< K,(log(,.,))- + 6k f ) , (22) 

\ J^i Vloglog(n2)y 

which tends to zero with our choice of j* and where k is the positive constant appearing 
in Lemma 1. Inequahties (21) and (22) entail that the right hand side of (19) is less 
than any a G (0, 1) as n is large enough. To finish the proof, observe that the choice 
of v^^-i{t) is driven by the fact that it corresponds to the smallest sequence such that 



4t* < (Au -i(t))2, which leads to 



/ ■* v^log(log?i2) \ / n2 



1/2 / \ 2s/(4s+2) 



"2 / \ Y^log log (712) 



9 Proof of Theorem 1 



Without loss of generality, we suppose that the support of the scaling function (p and its 
associated wavelet function ip is [0, 1]. Moreover recall that Jq 4^'' = 0. Let us give some 
a > which must be small enough. 



9.1 Discretisation of S 

For any given r = {s,p,M) G S, denote by j(r) the level 

2J-W = (nt-i)2/(4«+2) 

and define sj the solution of the equation j = j{sj,p, M) for any resolution level j £ J = 
{j«max' • • • '-^'^min^ ^ ^-^0' • • • ' ^^^^ 

J'^max = U(smax,P,Af)J and j^^.^ = [j{s^^^^^,p, M)\. 

Consider now the set Sn = {tj = {sj,p, M),j £ J} which appears as a discretisation 
version of a subset of 5 whose cardinality is of order 0(log(n)). 
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9.2 Prior and parametric family included in the alternatives 

For any Sj £ 5„, define a prior ttj wliich is concentrated on the class of tlie random 
functions 

3 



k e=l 



where caq is defined in assumption AInf and 

P{5k = 1) = P{h = -1) = 1/2 and Uj{n) = CiM{nt-^) ^^=^+2 

for Ci such that 3M^C^ = 2a^. Let j be any index in J. Since j 'tp = and when a is 
small enough (to guarantee that Cj > 0), Cj is a density. Easy calculations imply that 

lie, - cjp = M^Cf K,-i(r,))2 > ^2 (^^^_^)2, 
Moreover, if a is small enough, we have 3 < 1 and 

2i{.,+i-2/p)p ^ ^ I / c.v^jjp = 2^'(''^+^-2/p)p ^ Yl i%(")r 

k (- k e 

= SC^MP < MP, 

implying that cj G bs-^p^oo{M). Denote by Aj^n{a) the set of densities 

= {c G ^,p,oo(M) : inf ||c - ca|P > a^(i;„^-i (r,-))^}. 

AG A 

and consider the variation between both distributions IPx„ and IPji 



Var{lPx„IPu) 



where 



dIPxo Nn 



-y 



1 



dlPi 



dlPn 



dip 



Ao 



dIPx, Af„^_ "^^CAo' 



and iV„ = card (J). Assuming that the following assertion holds 

lim inf Tryfc G ^, „(a)) = 1, 



(23) 



we deduce that the left hand side {LHS) of relation (13) without the limit is bounded 
from below by 

LHS > iPA„(A. = l)+ sup sup lPc{Dn = 0) 

Tj€Sn ceAj.n{a) 

> l-Var{IPx„IPu)il + On{l)), 

as n large enough. Since the supports of the functions cj and cji are disjoint for j / j' , 
one has 

\ 2 " 



l-Var{IPx„lPn) > 



2 m 



n 



\ cxo{Ui,Vi) 



dTTjiCj] 



1 



> l-o„(l) 
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provided that 

2 



n . , 



(24) 



Relation (13) is thus proved if (23) and (24) are satisfied. The remaining proofs are given 
in the sequel. 

9.3 Proof of Relation (23) 

Let A' be a subsect of A. We have 

(inf \\cj - cxf < a^{Vr,t-^{Tj)f^ < 7Tj (^Jnf^, " caIP < (^-j))' 



+ ^j[j^flM,-cxf<a\v^,-^{T,)fj (25) 

Consider the particular subset A' defined by 

A' = {A e A : llcAo - cxf < 6CfM'{v^^^-^{T,)f}. 

Notice that 

XeA/A'^\\c,-c,f>a\v^,-.{T,)f 
due to the choice of Ci. It implies that the first term in the right hand side of (25) is null 



and then, it remains to prove that 

lim TTj ( inf ||c,- - ca||^ < a^iv ,-i{Tj)f ] = 0. 
Since A is in A', we get 

> 3CfM' {v^^^j {T,)f + 2 ^ 5kU,{n) B,,k,XM, 

k e 

where 

Bj,kXXo = j '0],fc(cAo - ca)- 
Therefore assertion (26) is equivalent to 



(26) 



or 



lim TTj sup 2y^{-5k)Uj{n)Bj^k,XM ^ o^(^nt~i ('^j))^ = 0. (27) 

n^oo \^AeA' ^ " / 
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We can construct in the Euclidean metric an ij-net A'^ on the subset A'. For any A in 
A', denote A the closest element in A^ to A in the Euclidean sense. Then for any A G A', 
we have by assumption AO: 



k k 



||oo 
k 



where k is a positive constant depending on Q, Ci, M and ||V'*'||oo- Chosing rj = n~'', with 
bi^ > n/'"'"1i , the proof of relation (27) is then reduced to the proof of 

Jim Card(A:,_Ovr, (^2 ,,x,Ao > = 

limjTnr^TT, (^2Y,{-Sk)nj{n)B^^^^^^^^^^ = 0, (28) 

where Diam is the diameter of A. Finally, relation (26) is proved applying Bernstein 
inequality in the right hand side of relation (28). Indeed Bernstein inequality is applied to 

TT, (^2^(-5fc)n,(n)S.,3^,^ >a2(^^,_.(r,))2j, 

with the i.i.d. centered random variables Zk = ~^kBj ^ In particular, Notice that 
\Zk\ < Kiv^^-i{Tj), X]fcVar(Zfc) < K2{v^^-i{Tj))'^ , where Ki and K2 are positive con- 
stants. Notice also that it leads to an exponential bound of order exp(— 2-^). 



9.4 Proof of Relation (24) 

Set 



Cj{Ui, Vi] 



-d-Kj{Cj). 



Due to the fact that the functions i/^j ^. have disjoint support, it is possible to rewrite Cj as 
follows 

k 



for 



Dj,k = Uj[n)2^—. 
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Then, 



and 



/n 

K i=l 

if" " 1 

= n 2 11(1 + D,^k{U^, Vi)) + - D,,k{Ui, V,)) , 

k U=l i=l ) 

= \{JV\{[^ + ^lk{U^,V,)\+2Y{[l-Dl^{U,,Vi)\ 



1=1 



i=l 



+H {D,^kiU^,Vi),[Df,^{Ut,Vt: 



<€{!,. ..,i-l,i+l,...,n} 



where bt is either or 2. Due to the independence of the data and acting as in [23], it can 
be shown that 



H[D,,k{U^,Vi)^D'%{Ut,Vt) 



0. 



t£{l,...,i-l,i+l,...,n}^ 

Therefore, 

iBAo[C(f^-^0] < ll{{l + lEx,Dl,{U„Vi)y + {l-E,,Dl,{U„V,)y} 

k 
k 

Using the inequahty log (cosh (n)) < Ku'^ where K is a fixed constant and since caq is 
bounded from below by m, one obtains 

^^exp(log(iEAo/„,02) < ±Y^explKn^^{lE^,Dl,{U,,Vi)A 



< 



log(n)'' 



log(n)(l + o„(l))' 



where k = K{3CfM'^)'^m ^ = AKa'^m ^. Choosing a small enough and k < 1, Relation 
(24) is then proved. 

10 Appendix A: Proof of Lemma 1 

In this part, k denotes any positive constant which may depend on (j), M and on ||c||, ||ca||. 
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10.1 Notations and Preliminaries 



Let us define or recall some notations that will be used below. For any k E 2L^, set 

Ck{Xi,Yi) = cl),^k{F{Xi),G{Yi)^ -cl),^k{F{Xi),G{Yi)) 

ioldX^,Yi) = 0j-fc(F(X,),G(y,))-Cj,fc(A) 
cvJ^,{Xi,Yi) = ^,,k{F{Xi),G{Y,))-cj,k, 

where i is in X2. First, the localization property of the scaling function implies that only 
few will be used since the others are zero. Indeed, one has the following result 

Lemma 4. For any k G 7L^, let us denote 

Nj = card {iel2;^k{Xi,Y,)^0}. 
Let 6 > 0. For any level j such that 



2^ < 



2 / n, 



one has 

F{Nj > 2(2L + 3)n22~J')) < K{n:[^ + n^^). 

We refer to [12] for the proof of this lemma since a similar result is established with an 
estimate F built on the whole sample: it guarantees in particular that -F(X(j.„)) = i/n, 
where denotes the i—th (among n) order statistic. In our case, the situation is 

different since is based on the observations lying in the subsample whose indices 

are in Xi whereas it is calculated in an observation lying in the subsample whose indices 
are in I2 ; nevertheless, applying the Dvoretsky-Kiefer-Wolfovitz Inequality, the following 

> 2e I is bounded 



deviation inequality holds. For any e > 0, = P ^ 
from above by 



_2_ 

n2 



< 



< 



F{X^i,n2)) - ^(-^(j;n2)) 
|i^-i^||oo +1 



i^-i^lloo > e 



F{X(^i,n2)) - F{X(^i.n2)) 



> e 



< K ( nf'^ + n^^ 



as soon as we take e = ^y 6 log{ni) / {2ni) V 6 log{n2) / {2n2) ■ Here F represents the 
empirical margin computed with the subsample whose indices in Ii and F, the empirical 
margin computed with the subsample whose indices in X2. 
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10.1.1 study of T,(A) 

Rewrite Oj^kW in Tj{X) = X]fc0j,fc(A) as follows 



1 



^2(^2 - 1) 



il7^i2 



For all i G 2'2, one has lE{ujjf^{Xi,Yi)) = cj^k — Cj,fc(A), which implies that 

k 

Moreover for p ^ k, one obtains 

iE(^(A)%;(A)) 

^ ^ ]E[Lulk{X,,,Y,,)] Ey,^{X,,,Y,,)] ]E[ijlk{X,,,Y,,)^ ^ 



(712(712 - 1))2 



+4 



(712(^2 -1))2 



< ^j,fc(^)6'i,p(A) + — (cj,fc - Cj,fc(A)) (cj,p - Cj,p(A)) 

712 



+ - 



112(7*2 - 1) 

which implies that 



<t>j,k - / <^j,fcCA 0j 



</'j,fc - / 0j,fcCA ) ( (l)j,p 
2 



pCA C 



Var(T, (A)) = 55 J^^^ e,,,(A) j - |^I?^e,,,(A) 



+ 



712(712 - 1) 



E 

k ,p 



kCx (Pj,p 



Applying the Holder Inequality and the consequence of the Parseval Equality, we get 

E 

kp 



•j,k - / 4>i,kCx (flj.p ~ / (t>j,pCX c 



2M E / (t>j.k(i>j.pc +2E / '^j-''^^ / <^j'P^ + (^E / 



(t>j.kC\ 



< 2^(^(^/0,V) I cl+[l cl) )<.2^^ 

We conclude that 



Var(rj(A)) < K 



1/2 



V 



4 

n2 



22i 



k,p 



^2(^2 - 1) 



2^ 2^-'' ■ 

< ^(_r,.(A)+ ,, 



^2 
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which is the announced result for Tj{\). 

10.1.2 Study of T* and r*(A) 

Let us denote 



k,p 



k 

We need the following results which are stated in the lemma below 

Lemma 5. Assume that the scaling function is q-differentiahle. For any level j < joo, 
there exists some positive constant k depending on (j), its derivatives and 

ll^^lloo (which 

might be ||ca||oo for some A G such that for any distinct indices ^1,^2? one obtains 

1E\A,\ < -(^)''' (29) 

< i^|C,,,2l<.(^) (30) 



V 111 

We prove relation (29) in the next section, relations (30) are proven in [12]. We have 



^2(n2 - 1) ^^^^^^ 
Using Lemma 4 and Lemma 5, it follows 



mTf\ < ^i-— (n,2-f2^^Yi^^)<fM!^ 
J n2(?^2 - 1) V "1 / V 

Moreover, we get 

Tf{X) = E E[^^(^n,^n)^^,.(^.2,^.2; 

ii ,126X2 k 

By Holder Inequality and from lemmas 4 and 5, one obtains 

1 / \^^^ 

- n.in. - 1) E E(^(^^))'E(^-i.^(^-^-))1 

^ ' ii,i2el2 \ k k / 

ilf^i2 
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Remembering that ]Eujjj^{Xi^,Yi^) = {cj^k — cj^kW) ^ov any index i2, we get 



1E\T*{X)\ < ^ 

n2[n2 - 1) 



(n22 ■^)n2 



22i 



log(ni) 



1/2 



ni 



Let us study the moments of T*(A) under iP;^. Since IExujjj^{Xi,Yi) = for any k and i 



we obviously have IE\T*{\) = and 



JEx{Tf{\)? 



il,i2,«3) 



where 



5] (i^A a.j^,fc(Xi„y,,)u;<^_p(x,„y,, 



k,p 



5, 



By Holder Inequality, we have 

k,p 

I n\ 

^ E (^^ \^lki.^^2.y^2)^]M-2.y^2 

\ k,p 

With Parseval Equality, we get 

(^A [oik {Xi2 , Yi, )u;lp{Xi, , y,, )] ) ' < Y^fJ clyj,k(p,,pcx) 

k,p k,p 



which combining with Lemma 5, implies that 

2\ 1/2 



T^,^i2 < K 2''^ 



log(ni) 



In the same way, 



ni 



1/2 



1/2 



«1,«2,«3 



< K Y^IExQ,,, 

\ k,p 



k,p 
2\ 1/2 

< 23J 



log(?ii) 



ni 
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From Lemma 4, one has 



n^(n2 - 1)^ V ^1 / 



< K2^ 



log(^i) 

712711 



10.2 Proof of Lemma 5 

The following expansion is crucial because it allows to reduce the study to univariate 
variables. 

UX^,Yi) = ikAX^)^k,{Y^) (31) 

where the univariate statistics ^k^{Xi) and ik2iYi) are defined as follows 

ik^y^) = <PiM{^^^-HM{G{Y,)). 
Assuming that cj) is continuously g— differ entiable, we get 



where 



Zk 



and 



Jf{x,) 



A direct application of the Dvoretsky, Kiefer and Wolfovitz Inequality leads to the following 
bound 

IP{\\F - F||oo > e) < A" exp(-2nie2) < Kn^\ 



as soon as e = 5\og{ni) /ni. In the sequel, we take such an e with 8 large enough. 

Since j < joo where joo is defined in (14), observe that 2^e < 1 and then we get 



l,...q- 

\wkAXi)\ < i^2('^+i/2)Je'^(l + op(l)) 



\zkAX,)\ < K2^e m^^ \<pfl{F{X,)m + op{l)) 

£=!,.. .0—1 
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which leads to the following bound 

< 7^ (^2('+^/^)^.' + 2^.^^max_^ (l + op(l)). 

The same kind of result obviously holds for ^f^^iYi). In the sequel, we need the following 
evaluations (which also hold for any derivatives of (j)). Using expansion (31), we get 

Ck{Xi,Yi) = S1 + S2, 

where 

51 = ^kr{Xi)^k,{Yi}, 

52 = ikAXMiM {G{Yi)) + ikM)(t>,M (HXi)) ■ 

Using (32), we get 

]E\S2\ < i^(2«e« + e). 
If 2i < (ni/log(ni))i/2"i/2«, we obtain IE\^k{Xi,Yi)\ < e which ends the proof. 

11 Appendix B : Proof of Lemma 2 

Let us denote r?(A) = ^2 ^ Sigja where 

= XI (^jkiP{Xi),G{Yi)) - Cjk) {cjk - Cjfc(A)) , 

k 

JEcZi = 0, 

\Z^\ < ( J^(.^,fc(F(X,),G(yO)-Cjfc)2 J^(cjfc-c,fc(A))M 

\ k k / 

1 /2 

< Ki ( (2^)^ (A)) < K 2^Tj (A) , 

and 

Vare(ZO < J] iB(0,fe(F(Xi), G(y,)) " c^fc) (0jp(F(XO, G(yi)) " Cjp) 

k,p 

X |(cjfc - Cjfc(A)) (cjp - Cjp(A))| 

k,p 

X \ {cjk - Cjk{X)) {cjp - Cjp{X))\ 

< l|c||oo (cjfc - Cjfc(A))^ = ||c||oo Tj(A). 

it 

Applying Bernstein Inequality to the Z'^ s leads to prove Lemma 2. 
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12 Appendix C: Proof of Lemma 3 

12.1 ^/-Statistic 

Let us first recall the result of [15]. 
Proposition 1. (THEOREM 3.3 P. 21 [15]) 

It exists an universal positive constant Kg < oo such that, if Q. is a hounded canonical 
kernel of two variables for the i.i.d. Zi^, Zi^, ii, Z2 G {l, • • • where n G IN, for any 
X > 0, we have 

IP I I ^0(Z,,Z,)| > .] < K.exp (-1 minjl^, ^, (|)'^' , (^)'^'} 



where 



A = \\n{-,-)\\^,B^ = fi[\\iE[n\Zi,-)]\\oo + \\iE[^H-,Z2)]\\oo] 

= n^IE[{n{Zi,Z2)f] 



and 



D = n sup {iE[n{Zi,Z2)ni{Zi)n2{Z2)] -. HnHzi)] < i;iE[nl{Z2)] < i}. 

We apply this proposition for Zi = (F{Xi), G{Yi)), h = n2 and the kernel 

{Zi,,Zi,) = {'Pj,k - m<Pj,k (Zi,)]} X {0,- (Z,,) - IE,[<Pj^k {z^,)]} , 

k 

which is considered under the distribution IPc where c is either cx or c. The quantities A, 
B, C and D are evaluated in the following lemma which is proved in the next section. 

Lemma 6. There exists some positive constant Ki larger than either 

{UL^UWD V (2||c||oo) V (2L2||0||y V (4||c||oo(||c||oo + ^L^UWlo))) 

such that 

A < Ki2^\ B"^ < Ki n22^\ < Ki n\i}^ , D < Ki ns, 
where c is either c\ or c. 

Again define c as c\ or c, then applying both the result of [15] and Lemma 6, for any 
level j and any x > 2^ {(n2 — l)?T'2)~"'^^^\/log (log (77,2)), it immediately follows that 



( I 

1—7- V ne{Zi,,Zi^)\> fix 

n2[n2 - 1) . 

\ n#!2 / 



< Kg exp {-S log(log(n2))) . 



which ends the proof of Lemma 3. 
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12.2 Proof of Lemma 6 

Let us denote {U,V) = {F{X),G{Y)) any pair of random variables whose marginal dis- 
tribution are both uniform on [0, 1]. Denote c the copula density which is cx or c; in the 
same spirit, the coefficients cj^k stand for cj^kW or cj^k- Recall that 

c,-fc(A) = IEx[<Pj,k{F{X),GiY))] = j cx{u,v)(l)j^kiu,v)dudv. 

Cj^k = IE[(t)j^kiF{X),G{Y))] = c{u,v)(t)j^k{u,v)dudv. 



Notice that 



Y,m<Pj,k{Un,Vi,)cl),,p{Ui„Vi,)] < 



k,p 



We get 



k 

^ \\'^<Pj,k{ui,Vl)(l)j,k{u2,V2)\\oo + 2\\'^(i)j^k{ui,Vl)]Ec[(l)j,k{U,V)]\\oc 
k k 

+ 11 



< L^2^^\\4>\\l + 2L' 



c\\22^ + llcll^ < K2'^^, 



oo||<-||2 



where K > 2L2||0||2^ and 



2n 



IE, [{<PJ,k{u^,,v,,) - mhkiu, V)]) {<p,,p{Ui„Vi,) - m<PjAu, V)])] 

k,P 

X {4>jAu2,v2) - iE5[4>j,k{u,v)]) {4>jAu2,v2) - iE,[(t>j,p{u,v)])\\^ 



< 2n2 



E 

k,p 



Vj,k(Pj,pC ■ 



kC / (pj^pC 



{^j,k(.U2,V2) - m^j,k(.U,V)]) 



X icPj,piu2,V2)-m^jAU,V)])l 



< 2n2(2||c| 



^ <Pj,ki^2, V2)4>j,p{^2, Z2) 



k,P 



^ (l}j,k{'^2,V2)lEc[(pjAU, V)] 
k,p 



+ 



Y,m^j,k{u,v)]m^j,p{u,v)] 

k,P 

< i4n2\\c\U{2'^L^2\\m + 2L'-^ 



+ 2^^\c\\oo) <Kn22^^ 
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where K > 4||c||oo(||c||oo + 3L^||</>||^). Moreover, 

C2 = nlJ2 [{ct>j,k{Uh,Vi,) - m^P.MU, V)]) {^j,p{Ui„Vi,) - E,[cl)j^p{U, V)])] 

k,p 

= nlY, i^c [(pj,kiUh , Vi,)(Pj,p{Ui, , Vi, )] - IE, [<P,^k{U, V)] ^(C/, V)]f 



k,p 



k 

< \\c\\lnl2^^ +nl\\c\\i<Knl2^^, 
where K > 2||c||^. Denote MQi,r22 = ^c[f^c(-2^i5 ■^2)^i,c(^i)^2,c(-^2)] and for i = I, 2, put 



By Holder Inequahty, we get 



y k k 



Applying again the inequality of Holder to X]fc(ci(A:))^ (the same occurs for C2{k)), one 
gets 

y'(ci(A;))^ < / {<j)j^k - ^c4>j,k{U,V))ni^cCl,k, 2iV-l+fci, .fe2 2N-l+k2 .f 

k k 

I / (^l.c)^ Iffel 2£-l + fci i„rfc2 2i-l + fe2l C ) 
\J ' '23 • 23 1^^2J' 23 < / 

< ||c||oo / (f^l,c)^C > Iffcl 2L-l + fci i„rfc2 2L-l+fc2 i 

<12||<AIIL^^ 

since iBa(Oi,c(?7))2) < 1. It follows that D < K n2, where K > 12L^\\(j)\\l^. 
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Copula 


True copula 


r = 0.25 


r = 0.50 


T = 0.75 


under Hq 










Gumbel 


Clayton 


1.00 0.0000 (0.72) 


1.00 0.0000 (0.99) 


1.00 0.0000 (1.00) 




Gumbel 


0.05 0.0105 (0.05) 


0.01 0.0049 (0.05) 


0.00 0.0000 (0.05) 




Frank 


0.31 0.0207 (0.15) 


0.36 0.0214 (0.40 ) 


0.24 0.0191 (0.84) 




Normal 


0.25 0.0195 (0.10) 


0.25 0.0194 (0.18) 


0.09 0.0iS7 (0.61) 




Student (4) 


0.99 0.0035 (0.14) 


0.89 0.0139 (0.22) 


0.62 0.0218 (0.55) 


Clayton 


Clayton 


0.05 0.0101 (0.05) 


0.51 O.O^S^ (0.05) 


1.00 0.0000 (0.05) 




Gumbel 


0.99 0.0028 (0.86) 


1.00 0.0000 (1.00) 


1.00 0.0000 (1.00) 




Frank 


0.97 0.0079 (0.56) 


1.00 0.0000 (0.96) 


1.00 0.0000 (1.00) 




Normal 


0.77 0.0190 (0.50) 


0.85 0.0178 (0.93) 


1.00 0.0000 (1.00) 




Student (4) 


0.45 0.0223 (0.56) 


0.14 0.0157 (0.95) 


1.00 0.0000 (1.00) 


Frank 


Clayton 


0.99 0.0022 (0.40) 


1.00 0.0000 (0.89) 


1.00 0.000 (0.97) 




Gumbel 


0.19 0.0175 (0.33) 


0.23 0.0190 (0.63) 


0.22 0.0i5^ (0.82) 




Frank 


0.05 0.0108 (0.05) 


0.01 0.0055 (0.05) 


0.00 0.000 (0.05) 




Normal 


0.14 0.0155 (0.08) 


0.39 0.0218 (0.20) 


0.72 O.O^Oi (0.41) 




Student (4) 


0.95 0.0096 (0.18) 


0.83 0.0167 (0.08) 


0.92 0.0121 (0.06) 


Normal 


Clayton 


0.97 0.0076 (0.31) 


1.00 0.0000 (0.80) 


1.00 0.0000 (0.92) 




Gumbel 


0.19 0.0176 (0.24) 


0.13 0.0151 (0.38) 


0.01 0.00^0 (0.38) 




Frank 


0.23 0.0190 (0.08) 


0.35 O.OSi^ (0.20) 


0.60 0.0219 (0.42) 




Normal 


0.05 0.0099 (0.05) 


0.01 0.0045 (0.05) 


0.00 0.0000 (0.05) 




Student (4) 


0.87 0.0149 (0.10) 


0.22 0.0i55 (0.08) 


0.08 0.0120 (0.06) 


Student (4) 


Clayton 


0.71 0.0204 (0.27) 


1.00 0.000 (0.77) 


1.00 0.000 (0.93) 




Gumbel 


0.98 0.0056 (0.19) 


0.74 0.0331 (0.34) 


0.29 O.O^OS (0.42) 




Frank 


0.28 0.4485 (0.09) 


0.80 O.Oi<^g (0.27) 


0.02 0.0061 (0.41) 




Normal 


0.84 0.0166 (0.05) 


0.20 0.0178 (0.04) 


0.03 0.0075 (0.04) 




Student (4) 


0.03 0.0074 (0.05) 


0.01 0.005^ (0.05) 


0.00 0.0000 (0.05) 



Table 1: umc = 500, = 20, n = 2048, nn = 2048. Seed 1. Empirical power for the test 
Hq : c = cao at the given level a = 10% where caq is specified in the first column and the 
data are issue from a copula density specified in the second column. The parameter of each 
copula density is chosen such that the Kendall's tau is respectively r = 0.25, 0.50, 0.75. 
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Family 


parameter grid 


Cardinal 


a 


Decision 


Gumbel 


1.05 : 0.1 : 1.95 


10 


0.00 


Yes 


Gaussian 


0.0 : 0.1 : 0.9 


10 


0.04 


Yes 


Clayton 


0.5 : 0.1 : 1.4 


10 


0.42 


Yes 


Frank 


1.5 : 0.5 : 6.0 


10 


1.00 


No 


Gumbel 


1.0 : 0.05 : 1.95 


20 


0.00 


Yes 


Gaussian 


0.0 : 0.05 : 0.95 


20 


0.00 


Yes 


Clayton 


0.5 : 0.05 : 1.45 


20 


0.54 


No 


Frank 


1.25 : 0.25 : 6.0 


20 


1.00 


No 


Gumbel 


1.45 


1 


0.10 


Yes 


Gaussien 


0.48 


1 


0.12 


Yes 


Clayton 


0.92 


1 


0.86 


No 


Frank 


3.20 


1 


1.00 


No 


Gumbel 


1.36 (1.07%) 


1 


0.02 


Yes 


Gaussien 


0.45 (3.27%) 


1 


0.08 


Yes 


Clayton 


0.41 (13.15%) 


1 


0.62 


No 


Frank 


2.88 (3.93%) 


1 


1.00 


No 



Table 2: Empirical probability a to reject the fit to a fixed parametrical family given in 
the first column and Decision at the prescribed level a = 5%. Multivariate null hypotheses 
(first and second part); Hq : c = Cj^, where A is obtained by inversion of the empirical 
Kendall's tau (third part); Hq : c = c^, where A is obtained by minimizing the ASE 
quantity which is given into brackets (fourth part). 
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